Extended Dependency Unification Grammar

نویسنده

  • Peter Hellwig
چکیده

A quick way of gaining a first impression of a formalism is looking at concrete examples. The sentence Arthur attends the Prague meeting. is turned into the following representation by the PLAIN parser: (UTTERANCE: assertion': clause type[main] (< PROPOSITION: attend: verb form[finite] tense[present] person[3rd] number[singular] (< SUBJECT: Arthur: noun person[3rd] number[singular]) (> DIROBJECT: meeting: noun (< DETERMINATION: definit': determiner sequence[1]) (< ATTRIBUTE: Prague: noun sequence[2])))); Fig. 1: Representation of a sentence Fig. 1 is an instance of the so-called Dependency Representation Language (DRL). DRL is used for all purposes in the PLAIN system, e.g. for representing parsing output, for storing information in a knowledge base, for writing grammars and lexica, for writing semantic rules, and as the source for generating natural language output. DRL is, as it were, the programming language of the linguist in order to make the system analyse and process a particular natural language. The expressions of DRL are lists (in the technical sence of list processing) consisting of terms; each sublist is included in parentheses, including sublists consisting of just one term. A list corresponds to a graph, in particular, to a tree. The structure of the tree in fig. 1 is elucidated by the indenting of the terms. Each term carries three kinds of labels: a role, a lexeme, and a complex category. The latter consists of a main category and a set of grammatical attributes followed by their values in square brackets. The symbols "<" and ">" indicate that the subsequent subtree corresponds to a segment in the input which is left or right to the segment corresponding to the term

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parsing Korean based on Dependency Grammar and GULP

This paper presents a parsing algorithm in Prolog using GULP, based on dependency grammar and unification-based grammar.1 It parses declarative sentences of a free-word-order language, Korean. The dependency grammar accepts free order of the words in a sentence. Unification-based features separate the grammar from the parsing algorithm and also simplify the notation of the grammar. GULP (Graph ...

متن کامل

CG-3 - Beyond Classical Constraint Grammar

This paper discusses methodological strengths and shortcomings of the Constraint Grammar paradigm (CG), showing how the classical CG formalism can be extended to achieve greater expressive power and how it can be enhanced and hybridized with techniques from other parsing paradigms. We present a new, largely theory­independent CG framework and rule compiler (CG­3), that allows the linguist to wr...

متن کامل

Encoding a syntactic dictionary into a super granular unification grammar

We show how to turn a large-scale syntactic dictionary into a dependency-based unification grammar where each piece of lexical information calls a separate rule, yielding a super granular grammar. Subcategorization, raising and control verbs, auxiliaries and copula, passivization, and tough-movement are discussed. We focus on the semantics-syntax interface and offer a new perspective on syntact...

متن کامل

Dependency Unification Grammar for PROLOG

The programming language PROLOG has proved to be an excellent tool for implementing natural language processing systems. Its built-in resolution and unification mechanisms are well suited to both accept and generate sentences of artificial and natural languages. Although supporting many different linguistic formalisms, its straightforwardness and elegance have perhaps best been demonstrated wit...

متن کامل

Tree Unification Grammar Problems and Proposals for Topology , TAG , and

This work presents a lexicalized grammar formalism which can be seen as a variant of multi-component tree adjoining grammar (TAG). This formalism is well-suited for describing the syntax of German because it relates a syntactic dependency graph with a hierarchy of topological domains. The topological phrase structure encodes the placement of verbal and nominal elements in the (ordered) field st...

متن کامل

Treebank-Based Acquisition of Multilingual Unification Grammar Resources

Deep unification(constraint-)based grammars are usually hand-crafted. Scaling such grammars from fragments to unrestricted text is time-consuming and expensive. This problem can be exacerbated in multilingual broad-coverage grammar development scenarios. Cahill et al. (2002, 2004) and O’Donovan et al. (2004) present an automatic f-structure annotation-based methodology to acquire broad-coverage...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993